Grapheme-to-Phoneme Conversion for Korean Point-of-Interest Data Using Prosodic Words
نویسندگان
چکیده
Point-of-Interest (POI) data are location enriched word data, which consist of personal names, place names including administrative districts, and names of business and recreational locations. They are essential to the development of applications such as Telematics, wireless location-based services (LBS), and tourism and other business applications. Being a major component of Text-to-Speech (TTS) systems and automatic speech recognition (ASR) systems, a Grapheme-to-Phoneme (G2P) conversion system maps the orthography to its corresponding sequence of phonemes. When POI data are converted into corresponding pronunciations, two characteristics of the data need to be considered. First, they are proper names, which are known to be closely related to irregular pronunciations in Korean. Second, most POI data are compound or complex words consisting of more than two words. A constituent of a POI word belongs to many other words. Therefore, it is a prerequisite to separate each word into its constituents in order to derive its pronunciation which includes irregular pronunciations. In this paper, we propose a G2P conversion system that detects irregular pronunciations for Korean POI data, by adopting the notion of prosodic word based on Prosodic Phonology (Selkirk 1984). The performance of the proposed system depends on the effective detection of irregular pronunciation words. According to our proposed system, all the words in the POI data were broken down into prosodic words, which were trained by category (regular, irregular or referential). The trained words were then used to create a dictionary for each category. By adopting this proposed system, irregular pronunciation words could be detected with increased efficiency using a far lesser number of words than other methods. For detecting words of irregular pronunciation, only 22.2% of input data on average were required for manual review for our proposed system, compared to 87% required when using the exceptions detection approach without using prosodic word as unit, and to 100% required when using the regular data-driven approach. The results of our experiment showed that 11.43% of 300k POI data were detected as being words of irregular pronunciation. Compared to the 6.67% obtained in a text corpus, POI data consisting of proper names comprise a much higher percentage of irregular pronunciations than ordinary text corpus. By combining phonology with computational modeling, the proposed system not only provides enhanced performance but also, assuming that phonology is a subsystem of the mind/brain (Durand & Laks 2002) or a branch of the study of mental representation, the psychology of mind (Hale & Reiss 2000), reflects the same method by which a speaker derives pronunciation of morphologically complex Korean proper names..
منابع مشابه
A Korean grapheme-to-phoneme conversion system using selection procedure for exceptions
Cultural, social, economic and other various environmental factors affect our language, and different words and terminology are used and coined for different contexts, which triggers quantitative change of vocabulary of a language. This paper presents a Korean grapheme-to-phoneme conversion system using a selection procedure for exceptions from added text corpus, which reflects such dynamic nat...
متن کاملRule-based Korean Grapheme to Phoneme Conversion Using Sound Patterns
Grapheme-to-phoneme conversion plays an important role in text-to-speech applications and other fields of computational linguistics. Although Korean uses a phonemic writing system, it must have a grapheme-to-phoneme conversion for speech synthesis because Korean writing system does not always reflect its actual pronunciations. This paper describes a grapheme-to-phoneme conversion method based o...
متن کاملSolving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach
To achieve high quality output speech synthesis systems, data-driven grapheme-to-phoneme (G2P) conversion is usually used to generate the phonetic transcription of out-of-vocabulary (OOV) words. To improve the performance of G2P conversion, this paper deals with the problem of conflicting phonemes, where an input grapheme can, in the same context, produce many possible output phonemes at the sa...
متن کاملPhonology of Exceptions For Korean Grapheme-to-Phoneme Conversion
Being an essential part of a Korean speech recognition system and a Text-To-Speech (TTS) system, a Korean Grapheme-toPhoneme conversion system is generally composed of a set of regular rules and an exceptions dictionary [1, 2, 3]. The exceptions have been recorded in the dictionary in a simple and random manner, whereas the researches on the regular rules have been actively progressed. This pap...
متن کاملUnlimited Vocabulary Grapheme to PhonemeConversion with Probabilistic Phrase Break Detection
This paper describes a grapheme-to-phoneme conversion method using phoneme con-nectivity and CCV conversion rules with probabilistic phrase break detection. The method consists of mainly four modules including phrase-break detection, morpheme normalization, morpheme to phoneme conversion and phoneme connectivity check. In the experiments with a test corpus of 210 sentences, we achieved 85% of p...
متن کامل